首页> 外文OA文献 >The Paralax Infrastructure: Automatic Parallelization With a Helping Hand
【2h】

The Paralax Infrastructure: Automatic Parallelization With a Helping Hand

机译:视差基础设施:助力自动并行化

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Speeding up sequential programs on multicores is a challenging problem that is in urgent need of a solution. Automatic parallelization of irregular pointer-intensive codes, exempli?ed by the SPECint codes, is a very hard problem. This paper shows that, with a helping hand, such auto-parallelization is possible and fruitful. This paper makes the following contributions: (i) A compiler framework for extracting pipeline-like parallelism from outer program loops is presented. (ii) Using a light-weight programming model based on annotations, the programmer helps the compiler to ?nd thread-level parallelism. Each of the annotations speci?es only a small piece of semantic information that compiler analysis misses, e.g. stating that a variable is dead at a certain program point. The annotations are designed such that correctness is easily veri?ed. Furthermore, we present a tool for suggesting annotations to the programmer. (iii) The methodology is applied to autoparallelize several SPECint benchmarks. For the benchmark with most parallelism (hmmer), we obtain a scalable 7-fold speedup on an AMD quad-core dual processor. The annotations constitute a parallel programming model that relies extensively on a sequential program representation. Hereby, the complexity of debugging is not increased and it does not obscure the source code. These properties could prove valuable to increase the ef?ciency of parallel programming.
机译:加速多核上的顺序程序是一个具有挑战性的问题,迫切需要一种解决方案。 SPECint代码所证明的不规则指针密集型代码的自动并行化是一个非常困难的问题。本文表明,在帮助下,这种自动并行化是可能且富有成果的。本文做出了以下贡献:(i)提出了一种用于从外部程序循环中提取类似管道的并行性的编译器框架。 (ii)使用基于注释的轻量级编程模型,程序员可帮助编译器查找线程级并行性。每个注释仅指定编译器分析遗漏的一小部分语义信息,例如,指出某个程序点上的变量已失效。批注设计为易于验证正确性。此外,我们提供了一种向程序员建议注释的工具。 (iii)该方法适用于自动并行化多个SPECint基准。对于具有最高并行度(hmmer)的基准,我们在AMD四核双处理器上获得了可扩展的7倍加速。注释构成了一个并行编程模型,该模型在很大程度上依赖于顺序程序表示。因此,调试的复杂性不会增加,并且不会掩盖源代码。这些属性对于提高并行编程的效率可能是有价值的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号